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ON The data accumulated by complete and dense surveys of 
the nearby Universe strongly suggest that galaxies tend to 

Q b le along thin wall-like structures surrounding large empty 
I regions (Geller & Huchra 1989, da Costa et. al. 1994). More- 

Q over, voids with typical size of about 5000 kms -1 appear 
to be relatively common features of the galaxy distribution 

^ as they are seen in different directions of the sky. Deeper 
surveys of a cross-section of the galaxy distribution out to 
about 50000 kms -1 , like the Las Campanas Survey (Schect- 
man et. al. 1992), lend further support to these findings. The 
qualitative picture that emerges is of a close-packed, volume- 
filling network of voids. Deep pencil-beam surveys also seem 
to agree with this cellular-like picture except that they de- 
tect structures on a scale of about 128 ft -1 Mpc (Broadhurst 
et. al. 1990). 

Despite the growing evidence that large voids and 
Great- Wall like features are common very little has been 
done in terms of quantifying their frequency and distribu- 
tion of sizes, primarily due to the intrinsic difficulty of iden- 
tifying voids as well-defined entities. On the other hand, 
the existence of large voids or, if one prefers, extended and 
thin coherent wall-like structures surrounding large empty 
regions is puzzling and their frequency may pose a serious 
challenge to theories of structure formation. To our knowl- 
edge, with the exception of a few case studies (e.g. Zeng & 
White 1991), there has not been a systematic detailed study 
of this question within the context of N-body simulations. 

An attempt to quantitatively understand the con- 
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straints that the existence of large voids may impose on 
theories of structure formation was made by Blumenthal et. 
al. (1992) and Piran et. al. (1993), assuming that the present- 
day voids are the result of the evolution of proto-voids. Ac- 
cording to this scenario the voids seen in the galaxy distri- 
bution are a natural consequence of the gravitational evolu- 
tion of primordial underdense regions which expand in time. 
Numerical experiments (Regos & Geller 1991, Dubinski et. 
al. 1993, Van de Weygaert & Van Kampen 1993, Einasto et. 
al. 1994, Sahni et. al. 1994), together with simple theoretical 
and statistical arguments (Blumenthal et. al. 1992), suggest 
that the evolution of underdense regions may lead to a natu- 
ral explanation for the geometrical properties of the matter 
and galaxy distribution. The models suggest that the galaxy 
distribution would be characterized by the size (diameter) of 
voids reaching shell-crossing today. These voids would also 
be the largest ones, filling a large fraction of the volume, 
which would be surrounded by high-density contrast walls 
(Dubinski et. al. 1992). 

The similarity between the predicted distribution in 
these models and the observed galaxy distribution under- 
scores the need for a more detailed understanding of the 
nature of the galaxy distribution. In particular, one would 
like to know how to determine whether the observed galaxy 
distribution is in fact void-filled, whether there is a charac- 
teristic scale for the observed voids and how successful differ- 
ent cosmogonies are in producing the abundance of observed 
voids. 

Unfortunately, until now no suitable statistics exist to 
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address all aspects of these issues. In contrast to overdensi- 
ties, which collapse to form small bound entities, voids are 
large and difficult to characterize in a point distribution. 
High-order statistics like the VPF (e.g. Lachieze-Rey et. 
al. 1992), and variants (Vogeley 1993 and references therein), 
have been used in the past. However, these statistics are sen- 
sitive to shot-noise, probe relatively small scales and are not 
convenient for comparing different samples as they depend 
on the mean density of the sample (see e.g. Little & Wein- 
berg 1993). Similar problems are present in other statistics 
like nearest-neighbor distribution (Ryden and Turner 1984). 
The most useful void statistic available at present appears 
to be the void spectrum analysis developed by Kauffmann 
& Fairall (1991), with which they managed to identify a 
large number of voids in the galaxy distribution, and which 
was shown to be a good indicator of the characteristic void 
"size" in N-body simulations (Kauffmann & Melott 1992). 
However, the method makes a priori assumptions about void 
topology. The genus analysis of the smoothed density field, 
introduced by Gott, Weinberg and Melott (1987), might 
shed more light on that issue. However, so far no conclu- 
sive results have been obtained, although some interesting 
conclusions could be drawn on the structure of voids in grav- 
itational instability scenarios when combining the topology 
analysis with the void spectrum (Sahni, Sathyaprakash & 
Shandarin 1994). 

Here we explore the possibility of using the two-point 
correlation function £ (r) for determining the maximum lin- 
ear extent of voids or diameters in case of spherical voids. 
The reason for returning to the two-point correlation func- 
tion is that it is a simple statistic, well-suited for a point pro- 
cess, independent of topology or other a priori assumptions, 
while for a cellular like distribution the first-zero crossing is 
a direct measure of the characteristic size of the cells. For 
instance, for a cubic lattice with a cell size L the correlation 
function can be expressed analytically and the first zero- 
crossing occurs at L/2 (e.g. Heavens 1985, also see Van de 
Weygaert, 1991, ch. 4). Furthermore, preliminary calcula- 
tions utilizing data from several different new surveys sug- 
gest that the scale for zero-crossing is close to 25 ft -1 Mpc, 
consistent with voids of 5000 kins -1 in diameter. Unfortu- 
nately, it is well-known that £(r) is very uncertain on large 
scales due to sampling fluctuations and uncertainties in the 
adopted normalization. The question we investigate is how 
large these uncertainties are for a given survey geometry 
and, considering the errors, if the first zero-crossing can be 
used to discriminate between different scales. 

In this investigation we make extensive use of the 
Voronoi tessellation which is physically motivated and 
should be at least a fair representation of the skeleton of the 
LSS. We utilize a Monte Carlo approach using the Voronoi 
tessellation as the underlying distribution to determine the 
errors in £ (r) and the statistical properties of the first zero- 
crossing from data subject to selection effects and finite sam- 
pling. 

In section 2 we describe the toy model we have used 
in our Monte Carlo experiments. In section 3 we investi- 
gate some specific examples of surveys. Our conclusions are 
summarized in section 4. 



2 MONTE CARLO SIMULATIONS 
2.1 Model for the LSS 

Most theories of the formation of structure on large scales 
are based on the gravitational instability scenario (e.g. Pee- 
bles 1980). In such scenarios voids are a natural consequence 
of the evolution of primordial underdense regions. Under- 
dense regions expand rapidly while overdense regions col- 
lapse (Icke 1984, but see Babul & Van de Weygaert 1994). 
Thus, eventually the large underdense regions will become 
the dominant morphological and, in many cases, dynami- 
cal component of the Universe. This crucial role of voids 
was already stressed more than a decade ago by Zel'dovich, 
Einasto & Shandarin (1982). Studies of the gravitational 
growth of proto- voids lead to a scenario of a void-filled uni- 
verse with voids having a characteristic scale set by those 
voids shell-crossing today (Blumenthal et al. 1992; Dubin- 
ski et al. 1993). Therefore, it seems reasonable to construct 
a model of the mass distribution in the Universe based on 
the evolution of the low-density regions. The results of the 
numerical experiments (Regos & Geller 1991, Dubinski et. 
al. 1993, Van de Weygaert & Van Kampen 1993, Little & 
Weinberg 1993, Einasto et. al. 1994, Sahni et. al. 1994) sug- 
gest that matter should flow away from the centers off the 
proto-voids until it encounters similar material flowing out 
of an adjacent proto-void. If, at a given epoch, the excess 
expansion in the dominant shell-crossing voids is rather sim- 
ilar, an assumption supported by the study of Dubinski et. 
al. (1993), matter will collect on planes that perpendicularly 
bisect the axes connecting the expansion centers. This pro- 
cess leads to the formation of a geometrical structure which 
resembles a Voronoi tessellation. Based on these consider- 
ations we here assume that the skeleton of the matter dis- 
tribution is approximately represented by a Voronoi tessel- 
lation (Van de Weygaert 1991b, 1994). The same physical 
motivation has led several authors to use this distribution 
to model the LSS produced in the explosive scenario (e.g. 
Yoshioka & Ikeuchi 1989). 

The use of Voronoi tessellations has recently become 
quite popular. After the earlier work of Kiang (1966) and 
Matsuda & Shima (1984), the statistical properties of 2-d 
foams were extensively examined by Icke & Van de Wey- 
gaert (1987). Subsequently, Van de Weygaert & Icke (1989) 
developed a geometrical 3-D Voronoi algorithm as well as 
an early version of a code to distribute particles within the 
walls, edges and vertices of the Voronoi network according 
to a heuristic prescription of the evolution of the large scale 
matter distribution. Amongst others, they found that the 
two-point correlation function of Voronoi vertices has a slope 
and amplitude that is strikingly similar to the one displayed 
by rich Abell clusters. Since then there has been an increased 
interest in the use of Voronoi tessellations as a useful descrip- 
tion of the large scale structure (Coles 1990; Van de Wey- 
gaert I99Ia,b; Ikeuchi & Turner 1991; SubbaRao & Szalay 
1992; Williams 1992; Williams et al. 1991). The main virtue 
of the Voronoi foam is that it provides a conceptually sim- 
ple model for a cellular or foamlike distribution of galaxies, 
whose ease and versatility of construction makes it an ideal 
tool for statistical studies. Its usefulness in this context is 
underlined by the fact that it is physically motivated and 
resembles the observed distribution composed of sheets, fil- 
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aments, and clusters surrounding voids. Although the model 
cannot say much about the pattern of the galaxy distribu- 
tion on small scales, it is nevertheless a useful prescription 
for the spatial distribution of the walls themselves. 

One might argue that the ideal approach to calculating 
errors in £ would be to do a large series of N-body simulation 
of a model with power spectrum well matched by the real 
data, and use them to define errors and covariances in the 
correlation function. However, this approach is computation- 
ally very demanding and not suitable for our purposes since 
it is model dependent and we have no a priori knowledge of 
the scale of voids. On the other hand, Voronoi tessellations 
can be constructed relatively fast and are easy to manip- 
ulate, so that the behavior of relevant parameters can be 
studied systematically in a more efficient way. For example, 
the scale of voids is an input parameter. 



2.2 Mock Surveys 

Formally a Voronoi tessellation is defined as a set of regions 
fl 8 of nucleus i. For a nuclei with coordinates X t the region 
fl 8 is defined as the set of all points which are nearer to X t 
than to any other Xj, j ^ i. Hence, Voronoi regions are 
convex polyhedra (3-D) with finite size. 

We developed a new version of a numerical Voronoi par- 
ticle code to construct a galaxy distribution in a 3-D peri- 
odic box. A random distribution of expansion centers is the 
starting point of this procedure. Their number density n s of 
nuclei determines the mean size of the voids, which is given 
by d s = 2(3/47r) 1/3 «r 1/3 . 

We restrict ourselves to a Poisson distribution of seeds. 
This is a plausible approximation if the present-day voids 
evolved from density minima in a Gaussian random phase 
density fluctuation field. The voids that have evolved to the 
shell-crossing phase at the present epoch correspond to den- 
sity minima on scales where the initial correlations were not 
very large or even negligible. Although we also could have 
considered correlated and anticorrelated seed distributions, 
we chose not to do so. Firstly, for moderate to negligible seed 
clustering the correlation properties of points in Voronoi 
walls and filaments, as well as of Voronoi vertices, differs 
only marginally from that in Poisson Voronoi tessellations 
(Van de Weygaert 1991b). Furthermore, it is important to 
emphasize that even in the case of Poisson seeds there will 
be a distribution of void sizes. Consideration of correlated 
or anticorrelated seed distributions would introduce an ad- 
ditional scale into the problem, which for correlated seeds 
would lead to a broadening of the distribution of voids' sizes 
in the Voronoi foam and as consequence the distribution of 
zero-crossings. We believe that in order to properly address 
the main issue of whether £ can pick up a characteristic 
scale in a non-regular void distribution when errors of fi- 
nite sampling and survey geometry are taken into account 
it is essential to investigate simple models. Consideration of 
a correlated seed distributions would only unduly compli- 
cate the problem without necessarily being a more realistic 
model. 

Subsequently, the galaxies are inserted into the network, 
their number density being determined by the galaxy lumi- 
nosity function (see below). The position of each galaxy is 
determined as follows. A point is inserted at a random posi- 



tion in the simulation box, and we determine the closest ex- 
pansion center to this point. By definition the inserted point 
is situated within the Voronoi cell of this 'nucleus'. Subse- 
quently, we determine geometrically the position, closest to 
the nucleus, on the line passing through the nucleus and the 
point that is as close to this nucleus as to some other ex- 
pansion center in the box. Evidently, this position is within 
the bisecting plane, and more specifically the Voronoi wall, 
between the two expansion centers. We put a galaxy at this 
position in the wall. By repeating this procedure for the 
required number of points, each wall of the Voronoi polyhe- 
dra gets filled with a random distribution of galaxies, each 
wall having a uniform surface number density whose value 
is linearly proportional to the distance between the defining 
expansion centers. Notice that this also implies the absence 
of power on small scales. 

Although the code has been developed to allow for more 
sophisticated cases, by introducing more realistic distribu- 
tions in which the vertices of the tessellations are identified 
with clusters, and the edges of the walls are identified with 
filaments, in most of our simulations we limited ourselves 
to the simple case in which the galaxies are randomly dis- 
tributed on the walls. By an additional displacement of each 
of the galaxies perpendicular to their walls, the amplitude 
of the displacement determined by sampling from a distri- 
bution function (in our case Gaussian or uniform), the walls 
are given a finite thickness. We have adopted a typical wall 
thickness of 500 kins -1 . Tests have indicated that our con- 
clusions are not sensitive to the thickness of the walls. 

In most of our Monte Carlo experiments we adopted a 
single (average) void scale. We have used voids with typical 
sizes of 25, 50 and 100 ft -1 Mpc, which cover the range of 
scales of interest. A more realistic scenario for the universe, 
as pointed out by several authors (Dubinski et al. 1993, Van 
de Weygaert 1991b, Van de Weygaert & Van Kampen 1993, 
Sahni et. al. 1994), would be a hierarchy of void sizes. Al- 
though, in principle we could attempt to simulate such struc- 
ture the lack of dynamics in our construction would lead to 
an artificial distribution of expansion centers and therefore 
to unrealistic models. Moreover, we are primarily interested 
in the typical scale of the void distribution which we expect 
will also have the most prominent ridges. 

In order to simulate real observations galaxies were as- 
signed magnitude according to a Schechter luminosity func- 
tion given by tp 

v=v '{^Y e ~ L/L ' ■ (i) 

In our calculations the parameters for the luminosity func- 
tion were taken, whenever possible, from the determina- 
tion of the original authors. Only galaxies brighter than 
Mmin = —17 were considered since as we discuss below we 
consider only semi- volume limited samples in the calculation 
of the two-point correlation function. This bright cutoff was 
also necessary for the simulation of the deep surveys, as the 
number of galaxies increases dramatically. 

Our Monte Carlo simulations consist of choosing a typi- 
cal void size, generating a tessellation in a box of size appro- 
priate for the depth of the survey considered and assigning 
magnitudes according to equation (1). Assuming that the 
observer is located in the middle of the box we extract for 
each Voronoi tessellation a sample with the same geometry, 
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Figure 1. A 20 h~ 1 Mpc thick cut of a Voronoi tessellation in a 
3-d box of 350 h~ 1 Mpc to the side. The galaxies were selected 
using the same selections as in the SSRS2 survey (see section 3.1). 



effective and apparent magnitude limits as the survey con- 
sidered. We used 1000 realizations for each case we studied. 
In figure 1 we show a typical cut of such a tessellation with 
cells of a characteristic size of 50 h' 1 Mpc using the same 
selection criteria as those of the SSRS2 survey (see section 
3.1). 



2.3 The Two-point Correlation Function 

We define £(r) as the probability in excess of Poisson distri- 
bution of finding a galaxy in a volume 8V a distance r, away 
from a randomly chosen galaxy, 



8P = n8V [l + f(r)] 



(2) 



where n is the mean number density of galaxies. 

In order to calculate £ we generate a random sample 
with the same selection criteria as that of the Voronoi tes- 
sellation. We then use the estimator 



Ndd(t) n R 
Nbr(t) nr> 



1 



(3) 



where Nbd and Nbr refer to the number of data-data and 
data-random pairs, respectively, at a separation r±8r (8r <C 
r). This estimator is unaffected by the geometry of the slices 
since the random sample has the same geometry and suffers 
from the same boundary effects. The estimator in Equation 
(3) can be generalized to include weighting functions. The 
weight can depend both on the distance of the objects from 
the origin, and the distance of the objects from each other. 
We chose to weigh the number of pairs, N, and the mean 



number density of the galaxies, n, by the selection function 
4>(t). We define (f>(r) as 



4>{r) 



r . <p{L)dL 

■J Lmin •■ ' 



(4) 



where L(r) is the minimal observed luminosity at radius 
r and Lmin is the luminosity that corresponds to M mtn . 
Specifically we use 



1 



(5) 



4>(ri)4>(rj) 

where xy stands for DD or DR and the sum is over all the 
galaxies separated by a distance r ± Sr. The density n was 
estimated as 



1 V- 

V -W(r 8 ) 



(6) 



where V is the volume of the survey (e.g. Davis and Huchra 
1982). 

We have also performed some calculations using the es- 
timator 



w x NddNrr 
t( r ) = —2 1 



(7) 



proposed by Hamilton (1993). This estimator is expected to 
be less sensitive to uncertainties in the mean density. 



3 RESULTS 

Utilizing the mock surveys we determine the errors in the 
correlation function due to finite sampling, sampling fluctu- 
ations and finite volume. We also examine the distribution 
of the first zero-crossing of the correlation for the differ- 
ent surveys considered below. As illustrations we have con- 
sidered in our study the following examples: dense, wide- 
angle surveys like the combined CfA2 and SSRS2 (Geller 
and Huchra 1989, da Costa et. al. 1994), moderately deep 
but sparse surveys like the Stromlo-APM Survey (Loveday 
et. al. 1992) and deep, thin and almost complete surveys 
like the Las Campanas Survey (Schectman et. al. 1992). We 
have chosen these surveys because they are either complete 
or near completion and represent the range of strategies re- 
cently adopted by different authors. We have not included 
in our analysis the deep pencil beam surveys like that of 
Broadhurst et. al. (1990) since their geometry is not as eas- 
ily accommodated to our existing code. 

In all the calculations of the correlation function pre- 
sented below we have semi-volume limited the samples an- 
alyzed, considering only galaxies with absolute magnitudes 
brighter than M = -18.5. We have also only considered galax- 
ies within a maximum distance where the selection function 
drops to about 0.1. Hereafter we refer to that distance as 
being the "effective" depth of the survey. 



3.1 Surveys 

In our analysis we have not attempted to reproduce ex- 
actly the regions and all the details of the selection criteria 
adopted in the surveys considered but only the more gen- 
eral characteristics like the approximate sky coverage and 
limiting magnitude. 
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To represent the geometry of the nearby dense sur- 
veys we take the geometry of the combined CfA2 north + 
south samples, the recently completed SSRS2 of the south- 
ern galactic cap and the ongoing survey of the northern 
galactic cap south of 8 < 0°. This sample consists of about 
18000 galaxies brighter than tob(o) = 15.5 covering roughly 
1/3 of the sky to an effective depth of about 130 h~ Mpc. 
The missing chunk of sky in the northern galactic cap should 
be completed in the near future. Below we refer to a ge- 
ometry like the SSRS2 (South) as S, with a volume of 
8 x 10 5 h~ 3 Mpc 3 and a density of 5 x 10 -3 galaxies h 3 
Mpc -3 , and the combined sample SSRS2 (South+North) 
and CfA2 (South+North) as SNSN, with the same density 
but 4 times the volume. 

Although the parameters describing the Schechter lu- 
minosity functions for the CfA2 and the SSRS2 have been 
found to differ (Marzke et. al. 1994, da Costa et. al. 1994) we 
have for our purposes adopted a single luminosity function, 
using the parameters appropriate for the SSRS2 survey with 
M* = -19.6, a = -1.2 and ip* = 0.97 x 10" 2 h 3 Mpc" 3 . 

As a second example we consider the Stromlo-APM Sur- 
vey (Loveday et. al. 1992, hereafter APM), which randomly 
selects galaxies at a rate of l-in-20 drawn from a complete 
magnitude-limited sample in the range 15 < bj < 17.15. 
This survey covers an area of 4300 square degrees and ex- 
tends out to about 40000 kins -1 . The Voronoi tessellations 
for this case were generated in a 3-D box 800 h' 1 Mpc 
to the side, The parameters for the Schechter luminosity 
function were taken to be M* = —19.5, a = —0.97 and 
ip* = 1.4 x 10 -2 h 3 Mpc -3 , as determined by Loveday et. 
al. (1992). During the process of generating the tessellation 
galaxies were randomly discarded at the same sampling rate 
adopted in this survey. In the calculation of the correlation 
function we further limited the sample to a distance of about 
290 ft -1 Mpc. This survey has a volume of about W 7 h~ 3 
Mpc 3 and a density of 1.7 x 10 -4 galaxies h 3 Mpc -3 , about 
30 times less than the dense surveys. 

As a final example we consider the Las Campanas Sur- 
vey (Schectman et. al. 1992, hereafter LC) of galaxies in the 
magnitude range 17.5 < tob(o) < 19-2 in thin slices. This 
survey was designed to probe scales where one would expect 
the distribution of galaxies to reach homogeneity. The sur- 
vey extends out to 60000 kins -1 , probing several structures 
with characteristic sizes as large as 100 ft -1 Mpc. In the 
present analysis we only consider the geometry of a thin but 
completely filled-in strip 120° x 1.5° as the one currently 
completed. Since the luminosity function for this survey is 
not available we have adopted the Schechter parameters de- 
rived for the SSRS2. In the calculation of the correlation 
function the maximum distance considered is of about 600 
h' 1 Mpc. In our analysis we neglect other known selection 
criteria such as the cut in surface brightness and the incom- 
pleteness of the survey in dense regions due to the finite 
number of fibers available for each field. The volume of this 
survey is roughly the same as that of SNSN with about the 
same density of galaxies. 



3.2 Discussion 

In figure 2 we present in the different panels the mean cor- 
relation function and the errors derived from 1000 Monte 
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Figure 2. For each of the geometries described in the text 
we show the mean correlation function derived from 1000 semi- 
volume limited samples. In this example the diameter of the voids 
is 50 h- 1 Mpc. 



Carlo simulations for each of the surveys described above 
with voids of size 50 ft -1 Mpc. As expected the mean cor- 
relation obtained from 1000 realizations agrees remarkably 
well with the all-sky "sample" and with the true correla- 
tion function computed from the 3-D Voronoi tessellation 
free of luminosity and geometrical effects. In particular, the 
first zero-crossing of the mean correlation function occurs 
at roughly half of the scale of the typical void imposed in 
the simulation. Although we only show the case for voids 
with a typical diameter of 50/t -1 Mpc, similar results were 
obtained for the other void sizes. 

The Monte Carlo simulations allow us to estimate the 
amplitude of the error we may expect in the determination 
of the correlation function from real data due to finite sam- 
pling. The errors were computed from the variance of the 
correlation function at each separation obtained from the 
1000 realizations considered and includes contributions from 
shot-noise, sampling fluctuations as well as other systematic 
errors due to the finite volume considered. The amplitude 
of the error, represented by the error bars in figure 2, is 
shown in figure 3 for the different surveys considered and 
the different scales of inhomogeneities. In all cases the er- 
ror is relatively large and comparable to the amplitude of 
the correlation function in the region where it becomes neg- 
ative (anti-correlation), especially for large voids (100 ft -1 
Mpc). It has been for this reason that the two-point corre- 
lation function has been discarded as a useful statistic to 
characterize the clustering properties on large scales. Note, 
however, that this depends on the size of voids and the ge- 
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Figure 3. The amplitude of the errors in the mean correlation 
function for each of the geometries described in the text. With 
25 h~ 1 Mpc voids (dashed line), 50 h~ 1 Mpc voids (dotted line), 
100 h- 1 Mpc voids (solid line). 
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Figure 4. The distribution of the first zero crossing for each of the 
geometries described in the text with 25 h~ 1 Mpc voids (dashed 
line), 50 Mpc voids (dotted line), 100 Mpc voids (solid 

line). For each case the distribution is from 1000 simulations. 



ometry of the survey. For voids less than 50 ft -1 Mpc all 
geometries lead to comparably small errors. On the other 
hand, for larger voids the error is significantly larger and 
only the APM survey seems to be suitable to measure the 
zero-crossing with some degree of confidence (see discussion 
below). 

The ability to detect the scale of the voids from the 
correlation function as measured from the data should de- 
pend on the probability distribution of the first zero-crossing 
taking into account the various sources of errors. To inves- 
tigate in more detail the statistical properties of this quan- 
tity we have computed the distribution of zero-crossings ex- 
pected for Voronoi tessellations with typical void sizes of 25, 
50 and 100 ft -1 Mpc. The resulting distributions, obtained 
from 1000 realizations for the different survey geometries, 
are shown in figure 4. The effect of the void size relative to 
the total volume can be clearly seen as the distribution for 
small voids is relatively well defined and close to a Gaussian. 
As we increase the void size the distribution broadens and 
overlaps the distribution obtained for larger voids. In some 
cases, the maximum of the distribution also shifts to smaller 
values leading to an underestimate of the true void size. This 
is a noticeable effect for surveys with small volumes such as 
the case shown in the upper-left panel for voids 100 ft -1 Mpc 
in diameter. 

We expect that the effectiveness of a given survey to 
determine the typical size of voids from the correlation func- 
tion depends primarily on how well defined the distribution 
of the zero-crossings are, for the different void scales, and 



the relative overlap between them. Using this as a qualita- 
tive measure we immediately see that both the dense, wide- 
angle, nearby survey (SNSN) and the APM survey are more 
suitable to detect the scale of voids than the LC survey. 
Since the SNSN and LC have approximately the same vol- 
ume the behavior of the zero-crossing distribution indicates 
that the critical characteristic of the survey is not only its 
volume but also its solid angle, which combined determine 
how many independent voids are fully contained in a given 
survey. A two-dimensional survey, like the LC survey, al- 
though deep, picks up different scales as only a cross-section 
of the distribution is seen and different parts of the voids 
are seen (see figure 1). This is a direct consequence of stere- 
ological considerations (see e.g. Stoyan, Kendall & Mecke 
1987) from which we know that the size of cells in infinitely 
thin 2-D cross-sections through 3-D tessellations is smaller 
than the size of the corresponding 3-D cells. In particular 
it can be shown that the cell volume distribution in a 3-D 
Voronoi tessellation has a considerably different character 
than the size distribution of the 2-D cells in a cross-section 
through that tessellation (Van de Weygaert 1991b, 1994). 
While the 3-D cell volume distribution is essentially a broad 
peak centered on the average (typical) cell size, the sectional 
cell area displays an extended distribution with an average 
size significantly lower than the average size of its 3-D coun- 
terparts (by a factor of PS 0.8), a long tail towards large sizes, 
and an important and increasing contribution towards small 
section sizes. Dependent on the survey geometry, and in par- 
ticular the solid angle they occupy on the sky, the surveys 
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Figure 5. The distribution of the first zero crossing as a function 
of x. For each of the geometries described in the text with 25 h~ 1 
Mpc voids (dashed line), 50 Mpc voids (dotted line), 100 
Mpc voids (solid line). For each case the distribution is from 1000 
simulations. 
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Figure 6. The cumulative distribution of the first zero crossing 
as a function of x. For each of the geometries described in the 
text with 25 h~ 1 Mpc voids (dashed line), 50 h~ 1 Mpc voids 
(dotted line), 100 h~ 1 Mpc voids (solid line). For each case the 
distribution is from 1000 simulations. 



discussed in this paper exhibit behavior that lies somewhere 
in between the extremes of a full three-dimensional survey 
and an infinitely thin cross-section. It is clear that the 2-D 
surveys are less sensitive to the true scale of the voids, and 
that a survey with a larger solid angle is better suited to 
constrain the size of voids. 

To better understand the nature of the distributions 
shown in figure 4 we present them in a more convenient 
form in figure 5, where we plot the frequency distribution 
/ = N crosstng /Ntotai as a function of x = log(R zer o/ Rvoid) 
to eliminate the scaling factor and allow a direct comparison 
between them. Here R zero is the scale of the zero-crossing 
and R V oid is the radius of the typical void. From the figure 
it is clear that surveys probing small volumes (case S) the 
zero-crossing distribution of large voids is shifted to smaller 
scale, which leads to a systematic underestimate of their 
sizes. It is also clear that for the SNSN the distribution is 
somewhat skewed to large zero-crossings, while APM and 
LC are more symmetric, the latter being, however, broader 
then the rest. It is interesting that for the APM the dis- 
tribution of zero-crossing for the small voids is significantly 
broader than in the case of more dense surveys. A possi- 
ble explanation for this is that for a total fixed number of 
galaxies, a distribution consisting of small voids will have 
too few points per structure causing the small structures to 
be poorly delineated yielding a broader distribution. This is 
apparently the main limitation of a sparse survey. 



In order to compute confidence intervals we calculate 
the cumulative distributions Pi and P2 defined as 

Pi(x < zi) = / f(x')dx' (8) 



and 

f CO 

P 2 (x >x 2 ) = / f(x')dx' (9) 

which measure the probability for x to be less than a speci- 
fied value x\ and for x to be larger than x 2 , respectively. In 
figure 6 we display Pi and P 2 for all the curves of figure 5. 

From the curves shown in figure 6 we can estimate the 
95% confidence interval for the zero-crossing scale given the 
void size and geometry. This was done by determining the 
values for x\ and x 2 for which Pi and P 2 were equal to 
0.025. From these values we derive the intervals given in 
table 1. From the table we see that in all cases the samples 
are sufficiently large to discriminate between voids of 25 and 
50 ft -1 Mpc. However, only the APM and marginally the 
SNSN can be used to discriminate between 50 and 100 ft -1 
Mpc voids at this confidence level. 



8 Goldwirth, da Costa & Van de Weygaert 

Table 1: 95% Confidence Interval for the Zero-crossing Scale. 



S SNSN APM LC 



Void size 

25 10.4-18.3 11.9-15.5 6.6-18.8 12.1-16 

50 18.9-66.6 22.73-40.5 20.5-35.4 22-57.7 

100 28.8-71 37.4-102.4 38.5-104.4 37-157 



zero-crossing may be more difficult to measure in combined 
samples like the SSRS2 and CfA2 because of possible prob- 
lems of non-uniformity in the selection criteria and because 
of effects introduced by redshift distortions. This is still be- 
ing investigated and we hope to be able to report on that in 
a future paper. 



4 CONCLUSIONS 

In this paper we have investigated the possibility of deter- 
mining the typical scale of voids from a simple statistic given 
by the first zero-crossing of the two-point correlation func- 
tion. The nice feature of using the properties of the two-point 
correlation function is that it is simple and adequate to de- 
scribe a point distribution as it does not impose any scale 
as other statistics do. It also has a very clear signature for 
regions devoid of galaxies. 

We have derived the probability distribution obeyed by 
this quantity for different redshift surveys using a Monte 
Carlo technique which should incorporate random as well as 
systematic errors to which the data is normally subjected. 
Our main conclusion is that despite the large errors in the 
determination of the correlation function on large scales ( > 
20 h' 1 Mpc), the zero-crossing statistic may be a useful tool 
in determining the scale of typical voids under the assump- 
tions that: 1) the galaxy distribution is void-filled; 2) that 
there is a characteristic scale for the void distribution; 3) 
that the Voronoi tessellation is a suitable model for describ- 
ing the gross features of the large scale structure. These as- 
sumptions although reasonable and expected from the visual 
inspection of the current redshift maps are by no means def- 
inite. In particular, we have not proven that the universe is 
void-filled. However, the existence of large coherent features 
and the fact that a large fraction of the observed volume 
is empty argues in favor of this "hypothesis'. We also note 
that for comparing our results to real data one must take 
into account the effects of redshift distortions and a valid 
comparison may require calculating the real space rather 
than the redshift space correlation function. 

Our most important conclusion is that the existing sur- 
veys are sufficiently large to allow us to detect the scale 
of typical voids in a void-filled Universe. In particular, the 
Stromlo-APM survey should provide us with a good test to 
check the claims of Broadhurst et. al. (1990) for the existence 
of voids as large as 128 ft -1 Mpc. 
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The same analysis was performed using the Hamilton 
(1993) estimator defined above for the nearby surveys S and 
SNSN. In general, we find that the zero-crossing distribution 
is essentially unchanged except for the large zero-crossing 
tail which is not as extended, leading to a slightly narrower 
intervals than those listed in table 1. A major difference was 
found only for the case of voids 50 ft -1 Mpc in the S case. 
This is probably because this is the most sensitive to the 
uncertainties of the mean density. Still we believe that for 
comparison with data this estimator should be preferred. 

We can also use these probability distributions in a 
slightly different way and ask, given a measurement of R zer o, 
what are the lower and upper limits for the size of typical 
voids. This can be done by assuming that the cumulative 
distributions Pi and P2 are always, for a given geometry, 
similar to those obtained for the 25 ft -1 Mpc and the 100 
ft -1 Mpc voids, respectively. With this assumption and given 
a measured value for R zer o we estimate the 95% confidence 
interval for the typical diameter of voids for each geometry 
considered. These intervals are presented in table 2. 



Table 2: 95% Confidence Interval for D vo %dj Rzero 



s 


SNSN 


APM 


LC 


1.37-3.51 


1.61-2.67 


1.33-2.60 


1.56-2.70 



For the SSRS2, for instance, preliminary results suggest 
Rzero ~ 38 ft -1 Mpc which, from table 2, implies that the di- 
ameter of the typical void is in the range 52 to 133 h' 1 Mpc 
at a 95% confidence level. Unfortunately, the range is suffi- 
ciently large not to allow to discriminate between the voids 
seen in the nearby surveys and those claimed by Broadhurst 
et. al. (1990) from deep pencil-beam surveys. However, if a 
similar value is obtained for the other surveys we would be 
able to set some useful constraints on the largest possible di- 
ameter for voids. In particular, if the measured zero-crossing 
is less than about 45 ft -1 Mpc we would be able to reject 
at a 95% confidence level the existence of typical voids 128 
ft -1 Mpc in diameter. We hope that these considerations 
will serve as an incentive for the different groups to publish 
the values obtained for the zero-crossing. We point out that 
the zero-crossing may be slightly more difficult to measure 
in combined samples like the SSRS2 and CfA2 because of 
possible problems of non-uniformity in the selection crite- 
ria. This is still being investigated and we hope to be able 
to report on that in a future paper. We point out that the 
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